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1 

DESCRIPTION 

Probes for the detection of RFLP in eucaryotic genomes. 

5 FIELD OF THE INVENTION 

This invention relates to the field of molecular 
genetics. More specifically, this invention relates to 
polynucleotides useful for nucleic acid hybridizations, 
methods for producing these polynucleotides, and methods 
10 for applying these polynucleotides in genetic analysis. 

BACKGROUND OF THE INVENTION 
Double stranded DNA is the most common form of 
depository of genetic information of organisms. Double 

15 stranded DNA has two complementary strands. Each strand 
is a polynucleotide sequence and the base sequences on the 
two complementary strands form Watson-Crick base pairs. 
The duplex structure of DNA can be disrupted in a number of 
ways, for example, by heating a duplex DNA solution in a 

20 0.1 M NaCl to 100 *C for a few minutes. At this 
temperattire , the two strands of duplex DNA separate • I f 
the solution is gradually cooled, the two strands of 
duplex DNA can re-associate to reform the duplex structure. 
The process of duplex formation from complementary 

25 polynucleotide or oligonucleotide sequences has been used 
advantageously for genetic analysis. Typically, a labeled 
polynucleotide or oligonucleotide sequence is used in a 
reassociation process whereby it forms a duplex structure 
with a substantially complementary sequence from a genetic 

30 source of interest. Because the labeled polynucleotide or 
oligonucleotide sequence is normally, though not 
necessarily, obtained from a source other than the source 
of interest, the process of association between 
complementary sequences has been known as nucleic acid 

35 hybridization, or just hybridization for short. The 
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The genetic information which can be adduced using 
"single-copy" DNA probes depends on the number of v^obes 
used the number of genetic loci each probe is capable of 
detecting, the heterozygosities and the allele frequency of 
5 the relevant genetic loci. To date, "single-copy" DNA 
sequwnces are known to detect only a single locus per 
sequence. Moreover, heterozygosity of DNA in higher 
organism is low. In man, it is about 0.001 per base pair. 
Finally, most polymorphic states detected are only 

10 dimorphic (i.e. there are only two representational states: 
absence or presence of a relevant restriction site on the 
restriction fragment in question). As is often the case, 
critical individuals in a genetic analysis are honozygous, 
and the genetic analysis may be uninformative. 

15 Genetic analysis in higher organisms has been 

simplified considerably by the availability of probes for 
hypervariable regions of genomic DNA. These hypervariable 
regions show multi-allelic variation and high 
heterozygosities. These regions, also appear to be widely 

20 interspersed within the genome. In each case, the 
hypervariable region comprises a variable number of tandem 
repeats of a short sequence (thus. Variable '"andem Repeats 
or VTR), and polymorphism results from allelic differences 
in the number of repeats at a given locus. This type of 

25 polymorphism, a subclass of RFLP has been called VTR 
Polymorphism. It is believed that the variation in repeat 
number arises by mitotic or meiotic unequal exchanges or by 
DNA "slippage" during replication. Therefore, if genomic 
DNA is digested with a restriction endonuclease which does 

30 not cut within the repeat unit, and if a genetic locus 
encompasses a variable tandem repeat or VTR, allelic 
markers would exist for that locus. (It should be noted 
that the so-called repeat unit is a hypothetical consensus 
sequence, and any actual VTR sequence in the genome is 

35 really a string of short "core" sequences, each of which is 
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ve y highly homologous,, but usually not identical to the 
consensi 3 sequence. Indee.-, a "core" sequence may dif '-.r 
in length from the consensus sequence. The consensus 
sequence is derived from examining and "averaging" over a 
5 large number of "core" sequences. A "core- seruence is 
tyrically at least 70%, but often more than 70%, homologous 
to the consensus sequence.) 

Jarman et al. have described a hypervariable region of 
DNA located 8-lcb downstream of the human alpha globin 

10 complex. The EMBO J. 5: 1857-63 (1986). This 
hypervariable region is composed of an array of imperfect 
17-bp tandem repeats, the number of which differs 
considerably (70-450) from one allele to another. Thus, 
this locus is highly polymorphic. Genetic polymorphism 

15 which reflects variations in the number of such tandem 
among individuals has been called Variable Tandem Repeat 
Length Polymorphism. 

The VTR ■ described by Jarman et al., supra, cross- 
hybridizes with other hypervariable genetic loci at low 

20 stringency. Thus a polynucleotide probe prepared from this 
region is potentially a very powerful probe, capable of 
probing many genetic loci in a single try. 

A. typical RFLP analysis involves digesting target 
genomic DNA with a restriction endonuolease, separating the 

25 digested DNA by gel electrophoresis, transferring the 
fractionated DNA in a denatured state to a binding surface, 
hybridizing the transferred DNA with a suitable probe, 
detecting the signals generated by the probe molecules 
which have become hybridized to the target DNA. The 

30 pattern of the signals generated would provide information 
about the target DNA. The pattern of signals can also be 
stored for later use, for instance, to determine or confirm 
an individual's identification (i.e., the -pattern would be 
the individual's genetic fingerprint). 

35 More commonly, two or more target DNA's are processed 
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for RFLP analysis, Denendlng on the sources of the target 
DNA, the Information g .nerated by comparison of t' e 
pattern can be used immediately as in the case of genetic 
identity (e.g,, identification of a suspect of a crime), 
5 or in the case where a high ^egree of genetic relatedness 
is present (e.g., paternity testing, sib analysis and the 
like). In other cases, che information derived from 
pattern comparison may for-n a part of a larger 
information-gathering effort. Pedigree analysis of 

10 distant relatives and correlation of a gene of genotype 
with a trait or medical condition are but two examples. 

However the RFLP analysis is to be used, the pattern 
of signals is controlled in large part by the probe or 
probes used in the analysis. A polynucleotide probe may be 

15 useful for any of a number of features. 

First, a probe may be able to detect polymorphism at a 
locus that other probes caimot detect. The locus may be 
particularly useful for genetic analysis in the general 
population because it has many evenly distributed alleles. 

20 Alternatively, the locus may be particularly useful for 
genetic analysis in a highly restricted segment of the 
population because it has a rare allele. 

Second, a probe may be able to detect many loci 
simultaneously and unambiguously when a particular 

25 restriction endonuclease is used to digest the target DNA. 

In this connection, it is useful to note that certain 
restriction endonucleases may be preferred because of the 
history of the target DNA samples, e.g. forensic samples 
Which has been exposed to the elements for an extended 

30 period of time. 

Third, probes are often used in combination 
simultaneously because their resolving power may be 
compounded. Compounding is obtained when the signals 
produced by the several probes do not overlap and permit 

35 unambiguous assignment of each (or substantially each) 
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signal to an allele of a locus. See, e.g., "The 
Application Of DNA-Prlnt For The Estimation Of Pateml :y", 
Baird et al^ in Advances in Forensic Haemogenetics 2: 354- 
358, Springer-Verlag, New York (19B7). 
5 Row RFLP phenotypes can be practically applied for 

paternity and forensic determinations have been discussed 
in Baird et al., supra; Baird et al. (II), "The Application 
Of DNA-PRINT™ For Identification From Forensic Biological 
Materials", in Adv. in Forensic Haemogenetics 2: 396-402, 

10 Springer-Verlag, New York (1987); and Baird et al. (Ill), 
Am. J. Hum. Genet. 39:489-501 (1986) and citations therein. 
These papers are hereby incorporated by reference. 

Reference has been made earlier in the instant 
disclosure that hybrid formation can take place even where 

15 there is a certain degree of mismatch between a probe and 
its substantially complementary target sequence. This 
process is particularly Important for the utility of 
multllocus probes. Such probes generally form well-matched 
h^rids with target sequences which originate from the same 

20 genetic locus as the probe, but they form less well-matched 
"cross-hybrids" with target sequences from other loci. As 
a result, the loci that can be analyzed with a given probe 
may vary significantly with the reaction (association and 
washing) conditions of the hybridization test. Thus, many 

25 loci are detectable under low stringency conditions, but 
ox^y a single locus is detectable under high stringency 
conditions. Therefore, a polynucleotide sequence which is 
capable of probing multiple polymorphic loci even under 
high stringency conditions represent an additional bonus. 

30 Strictly speaking, stringency of conditions has two 

components: conditions which govern formation of the 
h^rlds and conditions which govern the stability of (the 
duplex structure of) the hybrids. Typically, however, a 
hybridization test is performed at low (or relaxed) 

35 conditions during the hybrid formation phase to speed up 
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the process of association. Therefore, for the purpose of 
this applica^. Ion, stringency of ^nditlcis refer solely to 
conditions «fhlch govern the stability of the hybrids. (Of 
course, it a hybrid is not stable under a given set of 
5 conditions, it would not be formed in the first place under 
those conditions.) 

The factors whicL govern the stability of a hybrid are 
many, including, but not limited to the temperature, the 
ionic strength, the molecular species of the salts used, 

10 the degree of modification or elimination of bases on a 
polynucleotide sequence, the degree and nature of mismatch, 
and the length and type of polynucleotides sequence. 
Variation in one factor may be compensated or aggravated by 
variations in other factors. These and other relevant 

15 facts are well known to a person of ordinary skill in the 
art of molecular genetics. 

For the purpose of this invention, low stringency 
conditions mean an aqueous environment containing about 2X 
SSC at about 50-65'*C, or the equivalents thereof; and high 

20 stringency conditions mean an aqueous environment 
containing about O.IX SSC or less at about 65 ""c, or the 
equivalents thereof. [For formulation of IX SSC, see 
Example 3 in Section 6 below] . 

For the purpose of this invention, a "discrete poly- 

25 nucleotide sequence or subsequence" means a polynucleotide 
sequence or subsequence of greater than 15 nucleotides, 
but preferably greater than 50 nucleotides, and very 
preferably greater than 100 nucleotides; and a 
polynucleotide means a chain of about 15 nucleotides or 

30 more, and embraces the upper range of what sometimes passes 
as oligonucleotides. 

Many "single copy" DNA probes are known in the art. 
These probes do not relate closely to the present 
invention because their utility is generally (1) limited 

35 to providing genetic information at a single locus; and 
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(2) limited to detecting polymorpW-sm caused by alteration 
of a restriction site isi the neighborhood of the target 
cfenonL.c sequences. The polym. rphic probes of the present 
Invention are of the VTR type and do not suffer from these 
5 limitations. 

Polymorphic probes of the VTR type have also been 
described. However^ the hybrids fcrmed between a VTR probe 
and its target genomic sequences tend to be stable only 
under low to moderate stringency conditions, except for 

10 hybrids between the probe and target sequences from a 
single genetic locus. See Nakamura et al. (I), Science 
235: 1616-1622 (1987)? Jeffreys et al.. Nature 314:67-73 
(1985). The exceptional hybrids are stable even under 
high stringency conditions, possible reflecting the fact 

15 that the probes originated from this locus. 

Sometimes a genetic locus detectable with a VTR type 
probe may be very large,, spanning several hundred 
kllobases. In a restriction fragment length polymorphism 
analysis of such a large locus, a VTR can yield many 

20 polymorphic bands under high stringency conditions. 

However, the information which can be derived from such an 
analysis remains confined to the one locus. xn fact, VTR 
probes for loci of this kind have disadvantages. First, 
recombination within the large locus (which is expected to 

25 be more frequent than a similar but smaller locus) can 
complicate data analysis. Second, to obtain more extensive 
Information than is obtainable front only a single locus, 
two or more probes are preferably used in combination. The 
multiplicity of non- informational bands from the large 

30 locus may obscure bands detected by other probes used in 
combination, thereby making data analysis very difficult. 
The alternative to using probes in combination would be 
more costly multiple analyses of restriction fragment 
length polymorphism. Therefore, the difference between a 
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probe for a large single locus and a probe for multiple 
loci Is substantive and net merely semantic. 

BRIEF SUMMARY OF THE INVENTION 
5 The subject Invention concerns the following^ 

(1) polynucleotide sequences useful for detecting 
polymorphism In a species of organism of Interest, or a 
subpopulatlon thereof; 

(2) a polynucleotide sequence useful for detecting 
10 polymorphism at multiple genetic loci, and characterized by 

Its ability to form hybrids with restriction fragments of 
DNA, of 7.1, 6.2, 4.4, 4.2, 4.1, 3.7, 3.6, 2.6 and 2.2 
kllobases, produced by PstI digestion of genomic DNA 
extracted from K562 cells; 

15 (3) a polynucleotide sequence useful for detecting 

polymorphism at multiple genetic loci and capable of 
forming hybrids with generic DNA fragrments produced by 
complete digestion of Caucasold, American Black or Hispanic 
genomic DNA with the restriction endonuclease Psti of 

20 approximate allelic lengths and allelic frequencies as 
given In Table 2; 

(4) a polynucleotide sequence useful for detecting 
polymorphism at multiple genetic loci with high 
specificity. I.e., the polynucleotide sequence forms 

25 hybrids with genomic sequences at multiple genetic loci 
which remain stable even under high stringency conditions; 

( 5 ) a method for determining or obtaining a 
polynucleotide sequence useful for detecting polymorphism 
at multiple genetic loci with high specificity; 

30 ( 6 ) the use of the above-described polynucleotide 

sequence as a probe for polymorphism; 

(7) the use of the above-described polynucleotide 
sequence as a probe for polymorphism at multiple genetic 
loci; * 

35 (8) a method of genetic analysis comprising: 
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(a) digesting p DNA sample with a restriction 
endonuolease ; 

(b, separating the DMA rc itriction fragments 
according to size by electrophoresis; 
5 (c) transferring the taparated DNA to a binding 

surface; 

(d) hybridizing the transferred JNA with a 
polynucleotide probe labeled with a signal- 
generating moiety, wherein the polynucleotide 

10 probe is a polynucleotide probe of the present 

invention; and 

(e) detecting the signal generated; whereby the 
pattern of signals generated provides information 
about the composition of the DNA sample; and 

15 (9) recombinant vectors and cells useful for producing 

polynucleotides of the invention. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 shows the electrophoretic pattern of 

20 restriction fragments of DNA's from recombinant lambda 
phage 19 which was digested with EcoRI (Figure lA), and 
thB results of a Southern hybridization blot when the DNA 
fragments were transferred and probed with oligo-5'- 
CCCCCXX3T6TCGCTGTT-'3' (Figure IB). 

25 Figure 2 shows the results of 2 hybridization blots. 

PstI digested human genomic DNA from a group of 4 
unrelated individuals were probed with either DNA from 
Bands C of Figure lA (Figure 2A)^ or DNA from Band D of 
Figure lA (Figure 2B). 

30 Figure 3 shows the results of a hybridization blot 

where a panel of PstI digested human genomic DNAs from five 
unrelated individuals was hybridized with either the insert 
of pAC329 (Figure 3A), or the insert of pAC344 (Figure SB). 
Figure 4 shows the electrophoretic patterns of the 

35 Rsal digests of the inserts of pAC329 and pAC344. 



V 



wo 90/02818 



PCrAJS89/03473 



Figure 5 shows the results of a hybridization blot 
where PstI digested htiman genomic DNA was probed with Rsal 
restriction fragments of the inserts of pAC329 and pAC'^44. 
Figures 5A, 5B, 5C and 5D shows the results when Band 1^ 
5 2, 3 or 4 of Figure 4 respectively was used as probe. 

Figure 6 shows the restriction map of pAC365. 
Figure 7 shows the results of a hybridization blot 
where PstI digested human genomic DKA was probed with the 
insert of pAC365. 
10 Figure 8 shows the electrophoretic pattern of the 

restriction fragments produced by digesting the 1.35 kbp 
human Insert of pAC365 with BstKI. 

Figure 9 shows the results of a hybridization blot 
where PstI digested human genomic DNA was probed with the 
15 EcoRI-PstI (Figure 9A) or the EcoRI-NcoI (Figure 9B) 
subfragment of the 1.35 kbp htiman insert of pAC365* 

Figure 10 shows the results of hybridization blots 
where PstI digested human genomic DNA was hybridized with 
the insert of pAC365 under (1) low associational stringency 
20 and low stringency wash (Figure lOA); (2) low associational 
stringency and high stringency wash (Figure lOB); and (3) 
high associational stringency and high stringency wash 
(Figure IOC). 

Figure 11 shows the results of hybridization blots 
25 where PstI digested human genomic DNA was probed with the 
1.35 kbp hiunan Insert of pAC256, and washed under either 
low stringency conditions (Figure llA), or high stringency 
conditions (Figure IIB). 

Figure 12 shows the comparative banding patterns in 
30 hybridization blots where PstI digested human genomic DNA 
was probed with either "Band C DNA" (Figure 12A), or with 
19-MSP (Figure 12B). 

Figure 13 shows the results of a hybridization blot 
where PstI digested genomic DMAs from a family spanning 
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three generations were probed with the 1-35 kbp human 
Insert of pAC365. 

Figxire 14 shows the allelic diPtrll Jtions of genetic 
loci detectable by pAC365 in American Blacks (Figure 14A), 
5 Caucasoids (Figure 14B), and Hispanics (Figure 14C). 

Figure 15 shows the results of lybridization blots for 
qenetic identifica^-ion purposes. pAC365 insert was used as 
probe in a paternity test (Figure 15A), and a forensic test 
(Figure 15B>* 

10 Figure 16 shows the relationships among the human 

sequences of the invention. 

DETAILED DESCRIPTION OF TOE IKVENTION 
One embodiment of the instant invention is a method to 

15 obtain polynucleotide sequences useful for detecting 
polymorphism in a species, or a subpopiilation thereof. A 
library of genomic DNA digested with one or more 
restriction endonucleases and cloned in a suitable 
recombinant vector is screened witti a polynucleotide probe 

20 which comprises a string of "core" sequences (hereinafter 
"screening probe* ). This string of "core* sequences can, 
but need not be, a monomer, an oligomer or a polymer or a 
mixture of oligomers and polymers of a consensus sequence 
or "core" sequence of a VTR. Preferably, the screening 

25 probe is a mixture of oligomers of a consensus sequence, 
because a short consensus sequence can be easily 
syntiiesized chemically in large amounts and ligated to form 
a mixture of oligomers. In a preferred embodiment, the 
consensus sequence is 5 ^ -CCCCCCGTGTCGCTGTT-3 ' . 

3Q For the purpose of generating a genomic library, it is 

preferred that the restriction endonuclease digestion of 
genomic DNA be incomplete. One reason, is that many genomic 
VTR sequences may otherwise evade detection. This would be 
so if the relevant restriction endonuclease cuts within the 

35 VTR sequences, and the bulk of the VTR sequences will be in 
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relatively small pieces • The smaller the pieces , the 
greater the number of recombinant molecules which must be 
studied so that -^he hiuaan ge*^ome will be covered. For the 
same reason, it is preferred that a recombinant vector 
5 which can accommodate large DNA insert be used. Finally, 
wher«d the recombinant vector has a restricted cloning 
range, incomplete digestion of t^e genomic DNA would also 
tend to avoid under«-representation in the library of 
completely digested products which are smaller than the 

10 preferred cloning sizes. 

The recombinants which react positively with the 
screening probe in a hybridization test (hereinafter 
"positive recombinants") are selected for further 
examination. In a preferred embodiment, the recombinants 

15 are bacteriophages. The standard method of "phage lifts" 
can be used to identify the recombinants containing DNA 
inserts which hybridizes to the probe. See Maniatis et 
al.. Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratories, Cold Spring Harbor, New York, (1982). 

20 Briefly, a portion of a phage plaque is transferred to a 
nylon membrane where the DNA of the phage is immobilized 
and probed. Many plaques can be transferred in a single 
lift; moreover, the position of a plaque on the growth 
plate is in a one-to-one correspondence with the position 

25 on the membrane, thus permitting identification of the 
plaques which give rise to positive results in a 
hybridization test with the probe. Obviously, many 
variations of this basic technique can be designed with 
other cloning and/or transfer and/or identification 

30 systems . 

Once the positive recombinants have been identified, 
they can be subjected to tests which prove or disprove 
their utility. They are used as probes in hybridization 
tests against genomic sequences of a species of organism of 

35 interest, or a relevant subpopulation thereof. While the 
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present Invention broadly encompass eukaryotic organisms, 
on'" of the more commercially significant use is that of 
probing nammalian gjtvomes, particularly, the human genor^,. 
In any case, it is very highly preferred that the probe 
5 sequences of tne present invention be derived from the same 
species of organisms as the genetic materials which are to 
be tested in a hybridization tept. Thus, for applications 
of human genetic analysis, the starting library should 
preferably be a human genomic library. To avoid 

10 verbosity, the embodiments of this invention are described 
as if they apply to humans specifically . The present 
invention is not so limited, and is to be construed to be 
applicable generally to mammals and other eukaryotes. 

The useful positive recombinants are those which can 

15 detect Variable Tandem Repeat Length Polymorphism in 
humans, at multiple loci, and under high stringency 
conditions. Genomic DNAs from family members are 
separately digested with a restriction endonuclease, the 
digests are separately subjected to size fractionation by, 

20 for example, electrophoresis, and the fractionated 
restriction fragments are prepared for hybridization in 
any standard method. Positive recombinants or the human 
sequences or discrete polynucleotide subsequences inserted 
therein (Jointly and severally "test sequences") are used 

25 to probe the restriction digests. Preferably, a single 
test sequence is used at a time. However, several 
sequences can be grouped together in pr e li m inary tests to 
determine whether the group as a whole contains any useful 
sequences. 

30 The hybridization "banding" pattern for each 

individual member is determined. In particular, the sizes 
of restriction fragments which hybridize to the test 
sequences are determined. The segregation scheme of each 
band within a family or, more commonly, a number of 

35 families will inform as to the nature of the genetic locus. 
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(loci) being detected The nature of a genetic locus 
Includes, but Is not Irnlted to, the following: 1 . 
Mendeli?n or non-Mendellan segregat'on; 2« phenotype and 
frequency of alleles (reflected by the size of restriction 
5 fragments produced by the restriction endonuclease used to 
digest the genomic DNA, and the frequency of occurrence In 
a population); 3. linkage \.o another locus {reflected by 
co-segregation with other bands ^. If the test sequence 
detects multiple baxids which segregate Independently, It is 

10 capable of detecting multiple genetic loci. If test 
sequence detects bands of a locus which bands represent 
different-sized fragments among different individuals, it 
is capable of detecting polymorphism in that polymorphic 
locus. The determination of the nature of the locus (loci) 

15 detected by a test sequence from the segregation scheme is 
a straight forward application of classical genetics, and 
is well within the command of a person of ordinary skill in 
the art of molecular genetics. The size of families, and 
the number of families . needed to provide sufficient 

20 information to work out the segregation scheme would vary 
with the number of genetic loci being detected by the test 
sequence, the number of alleles in these loci, and the 
frequency of each allele. An ordinarily skilled artisan 
would also know how to determine the number and size of 

25 families to be studied. 

Another embodiment of the instant invention is the 
test sec[uences which can detect polymorphism in an species 
of organism of interest, or a subpopulation thereof 
Chereinafter'*useful test sequence"). 

30 In another embodiment of the instant invention, 

useful test sequences are cloned in recombinant vectors. 
In another embodiment, the recombinant vectors comprising 
the useful test sequences are harbored in a cell. 
Molecular cloning and transformation methods are well 

35 known in the art. 
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In a preferred embodiment:, the useful test sequences 
are such that the hybrids formed between tiie test sequence 
and genomic sequences from at least two genetic loci are 
stably associated under high stringency conditions* 
5 Because the segregation scheme is bath lengthy and 

expensive to workout, it is sometimes preferably to defer 
the study of segregation until a test sequence has been 
better characterized. Thus, it may be preferable to modify 
the method described hereinabove, namely, to use instead of 

10 genomic DMAs from members of families, merely genomic DMAs 
from random, unrelated individuals. If the banding pattern 
appear to vary from individual to individual, the test 
sequence Is presumptively treated as being useful as a 
polymorphic probe. If the number of bands detected in each 

15 of several individuals is significantly greater than one or 
two, the test sequence is presumptively treated as being 
useful as a multi-locus probe. 

The presumptively useful test sequence is analyzed, 
and less desirable sequence(s) are removed to produce an 

20 improved test sequence. For example, a test sequence may 
comprise a subsequence which is polymorphic as well as a 
subsequence which is non-polymorphic in the relevant 
population. The presence of non-polymorphic bands yields 
no useful genetic information about a human individual, but 

25 can Interfere with genetic analysis by, for example, 
obscuring a partially or totally informational band 
detected by a second probe used in combination with the 
test sequence. Another example is where the test sequence 
comprises a subsequence which is a hijgrhly repetitive 

30 sequence in the human genome- An example of a highly 
repetitive sequence is the "Alu sequence". See Houck et 
al., J. Mol. Biol. 132: 289 -306 (1979). Presence of such 
highly repetitive sequences in a probe often cause a high 
non-informational background signal in a hybridization 

35 blot. This background signal can be avoided by eliminating 
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the highly repetitive sequence component from the test 
sequence. See, for example, Sealey et al*, Nuc. Acids 
Res., 13: 1903-1922 (1985). 

Still another example is where the test sequence 
5 comprise a first subsequence which delivers only a small 
signal in a hybridization blot relative to the signal 
delivered by a secr^nd subsequence. Here, it may be more 
advantageous to eliminate the first test subsequence so 
that a more "cost-effective" probe which delivers a higher 

10 signal on a per nucleotide basis may be produced. A more 
specific example of this type is where the first 
subsequence is a ** single-copy" sequence , and the second 
subsequence is a Variable Tandem Repeat Sequence. On a per 
nucleotide basis, the second sequence delivers more signal 

15 whenever there are more than a single copy at a genetic 
locus. 

Discrete polynucleotide subsequences may be obtained 
from a test sequence in a number of ways, and are well 
within the capability of an ordinarily skilled artisan. 
20 For example , one end of the test sequence may be 
progressively removed by an exonuclease or SI enzyme, 
while the other end is being protected. Another example 
is digestion with a restriction enzyme. Other methods of 
obtaining subsequences are within the contemplation of the 
25 present invention. 

After the less desirable subsequences have been 
eliminated from a test sequence, the remaining portion of 
the test sequence is used in familial tests for the 
determination of the nature of the genetic locus (loci) 
30 which it is capable of detecting as described above. 

Finally, nearby genomic sequences (including nearby 
VTR sequences) may be reached by chromosome walk. 

As discussed in the Background section of this 
application, duplex formation and stability depend on 
35 substantial complementarity between the two strands of a 
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hybrid, axid a certain degree of mismatch can be tolerated. 
Pherufore, whenever a test t ^quence obtained as described 
hereinabove has been determined to be useful in probing 
target polynucleotides of interest, mutations (both single 
5 and multiple), deletions, insertions of the useful test 
seguence, and combinations 'hereof, wherein said 
mutations, insertions and deletions permit formation of 
stable hybrids with said target polynucleotide of 
interest, are part of the present invention. Mutations, 

10 Insertions and deletions can be produced in a given 
polynucleotide sequence in many ways, and tiiese methods are 
known to an ordinarily skilled artisan. Other methods may 
become known in the future. The known methods Include, but 
are not limited to: !• determining analytically the 

15 seguence of a test seguence of the present Invention, 
synthesize chemically or otherwise an artificial seguence 
which is a mutation, insertion or deletion of the test 
seguence; 2. using a test seguence of the present invention 
to obtain via hybridization a genomic seguence or otherwise 

20 which is a mutation, insertion or deletion of the test 
seguence; and 3. mutating, inserting or deleting a test 
seguence in vitro or in vivo. 

It is important to note that the mutational, 
Insertlonal, and deletiohal variants generated from a 

25 given test seguence may be more or less efficient than the 
test seguences, in the sense that (a) more or fewer 
genetic loci may become detectable, (b) more or fewer 
alleles of a particular locus may become detectable, (c) 
more or less stable under stringent hybridization 

30 conditions, and (d) any combination of the above. 

Kotwithstandlng such differences in efficiency, these 
variants are within the scope of the present invention. 

In another embodiment of the present invention, the 
useful test seguences described hereinabove are used for 

35 genetic analysis, (i.e., used as probes), including but 
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not limited to analysis of genetic identity, relatedness 
or alteration. In one prefesrred embodiment, the method of 
genetic analysis comprises: 

(a) digesting a DNA sample with a restriction 
5 endonuclease; 

(b) separating the DNA restriction fragments according 
to size by electrophoresis; 

(c) transferring the separated DNA in a state suitable 
for hybridization to a binding surface; 

10 (d) hybridizing the transferred DNA with a useful test 

sequence labeled with a signal-generating moiety, and 

(e) detecting the signals generated; 
whereby the pattern of signals generated provides 
information about the composition of the DNA sample. 

15 This and other embodiments of the present invention 

involve the use of xiseful test secpiences as probes in 
met: >ds of genetic analysis. 



20 



EXAMPLES 



Following are examples which illustrate procedures, 
including the best mode , for practicing the invention . 
These examples should not be construed as limiting. All 
25 percentages are by weight and all solvent mixture 
proportions are by volume unless otherwise noted. 

Example 1 - Detection Of Genomic Sequences Which Hybridize 
With Oligomers Of A Consensus Sequence Of A VTR 

30 Human genomic DNA incompletely digested with the 

restriction endonuclease EcoRI was cloned into the 
bacteriophage lambda Charon 30. Restriction fragments 
ranging from 4.5 kbp to 17.5 kbp can be cloned into this 
vector. Gene 12: 301-309 (1980). About .5000 phage 

35 plaques were screened according to the method of Maniatls 
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Southern format to probe human genomic DMAs from 4 
unrelated individuals. 

The target human sequences were restriction fragments 
produced by digestion with one of the following 
5 restriction endonucleases: PstI, Hinfl, Hlndlll, EcoRI, 
TaqI, Mspl, PvuII, Rsal, and BstNI. 

The following experiment and results were 
representative. 5 ug of PstI digested DKA from each of 4 
individuals were electrophoresed and blotted onto a nylon 

10 membrane. Nylon membranes were prepared in duplicate. 

Either [32p] -labeled DNA from Bands C of Figure lA, or 
[32p] .labeled DNA from Band D of Figure lA was mixed with 
excess unlabeled total human genomic DNA, and used to probe 
the target sequences on the membranes. Radioactive 

15 labeling was achieved by random 6-mer primed enzymatic 
synthesis, using radioactive precursors as substrates. 
However, other methods of labeling would also work as well. 
Total unlabeled human genomic DNA was added as a 
precautionary measure. It was known that the human genome 

20 contains widely dispersed highly repetitive sequences such 
as the Alu sequences. If the human insert in Phage 19 
contained these and/or similar highly repetitive sequences, 
such repetitive sequences would produce a heavy background 
signal over the entire area on the blot where human target 

25 DNA could be found. The introduction of total human 
genomic DNA would serve to suppress this background signal. 
Sealey et al., supra. 

The hybridization was carried out at 65*C in 5X SSPE, 
1-2% SDS (sodium dodecylsulfate) , 0.5-1 mg/ml heparin. The 

30 blot was washed in O.IX BSC, 2.5 mM sodium phosphate, 1% 
SDS at 65'C. [IX SSPE « 0.16 M NaCl, 0.01 M sodium 
phosphate, and 1 mM ethyl enediaminetetraacetic acid. IX 
SSC - 0.15 M NaCl, 0.015 M sodium citrate.] 

Figure 2 shows the results of these hybridization 

35 blots. Band C DNA was used in Figure 2A. Band D DNA was 
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Example 4 — Cloning Of The Hximan "Polymorphic Probe** 
Sequence From Phage 19 

DNA In Band C of Figure lA, representing part of the 
human sequence insert In Phage 19 was extracted, and cloned 
5 Into Bluescrlpt, which Is a trademarked cloning vector of 
Stracagene, San Diego, CA 92121 • Bluescrlpt Is a 
derivative of M13 as well as of nBR322, and Is about Skbp 
long. The EcoRI Insertion site was used. Several 
Bluescrlpt recombinants containing Band C human DNA were 

10 obtained. 

Two clones designated pAC329 and pAC344 were used to 
probe a panel of human genomic DMAs. DNA from 5 unrelated 
Individuals were digested with PstI and two duplicate 
filters were prepared for the Southern procedure. The 

15 hybridization conditions were the same as described In 
Example 3. Figures 3A and 3B show the results obtained 
when the Inserts of pAC329 and pAC344 were used as probes 
respectively. It Is readily seen that the same pattern (of 
multiple bands) were obtained. However, the pAC344 Insert 

20 Is slightly longer, and mbre sensitive. I.e., produces a 
stronger signal under comparable conditions. 

A third Bluescrlpt recombinant was able o detect only 
non-polymorphic bands. I.e. the banding patterns are 
Identical In several unrelated Individuals. (Data not 

25 shown). 

Band C appears to have more than one species of DNA 
sequences each approximately 5 kbp long. This Is 
consistent with the preferred cloning size of the Charon 
30 vector ( see Example 1 above ) • 
30 Tills Example shows that both pAC329 and pAC344 are 

useful as multi-locus, polymorphic probes. 

Example 5 - Analysis Of Human DNAs In pAC329 And pAC344 

pAC329 DNA was digested with EcoRI and the 
35 approximately 5 kbp human DNA Insert was Isolated. The 
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S£je was done with pAC344 DNA» The pxxrlfled inserts were 
further separately digested with Rsal, and the products jf 
the restriction digest was electrophoresed in a 1% agarose 
gel* The electrophoretic pattern is shown in Figure 4. 
5 Lane (M) contained molecular weight markers (lambd.. Hindlll 
an^. phi-x Haelll fragments ^» Lane (a) contained 
restriction fragments from the insert of pAC329; and lane 
(b) contained those from the insert of pAC344, 

DNA from each of the four marked bands shown in 

10 Figure 4 was isolated and tised to probe unrelated, human 
genomic DNAs in the Southern format. The hybridization 
conditions w^e as described in Example 3, except that 
unlabelled human genomic DNA was not present in the probe 
mix. Relevant results of hybridization blots against PstI 

15 digested human genomic DNA are shown in Figure §• The 
general conclusion is that DNA from each of Bands 1, 2, 3 
and 4 are useful as multi-locus, polymorphic probes. 

Figure 5A shows the hybridization resxats when DNA 
from Band 1 of Figure 4 was used as the probe. Under the 

20 above-indicated hybridization conditions, if the probe 
contained highly repetitive sequences such as the "Alu 
sequences", a background signal in addition to any 
specific bands might be produced, because the probe would 
have hybridized to homologous highly repetitive sequences 

25 which are widely dispersed throughout the human genome. 
Such background signal is indeed indicated in Figure 5A. 

Figure 5B shows the hybridization results when DNA 
from Band 2 of Figure 4 was used as the probe. Otherwise, 
the target and the hybridization conditions were identical. 

30 In contrast with Figure 5A, background signals which are 
indicative of the presence of highly repetitive sequences 
In the probe are not as evident here. 

Similarly, Figures 5C and 5D show the respective 
results when DNA from Bands 3 and 4 were used as the 
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rewpectlve probes. Figure 5D shows rather less background 
signal than Figure SC. 

This BKample shows tl at the subsequences of pAC329 a ;d 
pAC344 "represented by Bands 2 and ' are Improvements over 
5 PAC329 and pAC344 as multi-locus, polymorphic probes. 

DNA from Band 4, I.e., tl.a second largest restriction 
fragment from Rsal digestion of the human DNA Insert In 
PAC344 was cloned Into whe Smal Inserdon site of 
Bluescrlpt by "blunt end cloning". The recombinant 
10 molecule has been- designated pAC365. 

Example 6 - Analysis Of pAC365 

A restriction map of pAC365 DNA was obtained using a 
standard method. The results of the mapping Is shown In 

15 Figure 6. The human sequence can be excised from the 
recombinant with EcoRI and BamHI In the form of a 1.35 kbp 
(approx.) fragment. The human sequence Is flanked by 7 
nucleotides on one end and 11 on the other both from the 
polyllnker region of Bluescrlpt. See Figure 6. 

20 The 1.35 kbp fragment was excised and used to probe 

PstI digested human genomic DNA Identically prepared as In 
Example 5, and under Identical conditions. The results of 
the hybridization Is shown In Figure 7-. A comparison of 
Flgixre 5D with Figure 7 shows that the signal patterns In 

25 the two blots were Identical. This shows that the DNA 
sequence from Band 4 of Figure 4 was successfully cloned 
Into pAC365. 

The 1.35 kbp fragment was digested with BstNI and the 
restriction digest was electrophoresed in a 3% NuSleve 

30 agarose gel. [NuSieve is a trademarked product of FMC 
Corporation] . The electrophoretic pattern is shown in 
Figure 8. Lane M of Figure 8 contained phi-x Haelll 
molecular weight markers. The BstNI digest of the 1.35 kbp 
fragment shows a broad band at about 70-100 bp (indicated 

35 by an arrow in the Figure) and a band at a higher molecular 
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weight of approximately 0.9 kbp. The sum of the molecular 
weights pertaining to these two bands is below 1»35 kbp. 
Therefore, the results indicate that the 1.35kbp fragment 
comprises several copies of an approximately 70-100 bp VTR 
5 •'core" sequence* 

Example 7 Analysis Of The Subfraqments Of The pAC365 
Insert 

The 1.35 kbp human sequence cloned into pAC365 was 

10 excised from the plasmiid along with 18 nucleotides from the 
polylinker region of Bluescript by digestion with EcoRI and 
BamHI. The excised fragment is further digested with one 
of the following enzymes: PstI, Xbal, Sau3A I, Hinfl, and 
Ncol, thus generating five subfragments that each had «ie 

15 EcoRI site of l«35kbp fragment at one end. 

Each of these five subfragments were tested for its 
utility as multi-locus, polymorphic probes. Each is 
hybridized to a replicate panel of PstI digested, 
unrelated, human genomic DNAs. The hybridization 

20 conditions were as described in Exam-pie 5. Parts of the 
results are shown in Figure 9. Figures 9A, and 9B show the 
results where EcoRI-PstI, and EcoRI-NcoI subfragments 
respectively, were used as the probe. It is readily seen 
that each of these subfragments generated the same pattern 

25 as the full pAC365 insert. Compare Figure 7 with Figures 
9A, and 9B. Fuirthermore, the other subfragments also 
yielded the identical banding pattern. Hence, only the 
resiats for the largest (EcoRI-PstI) and the smallest 
(EcoRI-NcoI) subfragments are shown. 

30 Finally, the DMA in the 100 bp area of Figure 8 was 

used as probe , and wa s able to generate an identical 
pattern on a replicate panel of target human DNA. This is 
further evidence that pAC365 contains a VTR sequence. 

This Example shows that each of these subfragments are 

35 useful multi-locus, polymorphic bands. 
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Example 8 — Hybrid ' ''ormation Ani Stability Under High and 
Low Stringency Conditions 

The 1.35 kbp Insert of pAC365 was used to probe a 
5 panel of PstI digested htooan genomlo DNA. The results are 
shown In Figure 10. Figure lOA shows a Southern blot 
where hybrid formation took place at 50 *C, In 5X SSPE, and 
the blot was washed at 65*C In 2X SSC. Figure lOB shows a 
duplicate blot except It was washed at 65*^0 In O.IX SSC* 
10 No unlabeled total human genomlo DNA was added to the probe 
mix. 

It Is believed that the probe contained a small 
amount of Alu-llke sequence, and this accotmts for the 
background signal. More significantly, the background Is 

15 dramatically lower In Figure lOB than In Figure lOA. Most 
Importantly, the specific signals. I.e. , the polymorphic 
bands are strong and unambiguous when the blot was washed 
under high stringency conditions. 

When hybrid formation and washing take place under 

20 more stringent conditions, background suppression Is 
further Improved* Figure IOC shows a blot where hybrids 
were formed at 65 *C In 5X SSPE, and where the blot was 
washed at 65 *C In O.IX SSC. The hybrids were actually 
stable at the even more stringent conditions: O.OIX SSC at 

25 65*C. 

These results can be jxixtaposed with those obtained 
using pAC256 as a probe. pAC256 Is a Bluescrlpt 
recombinant containing a human Insert sequence which 
contains a Variable Tandem Repeat Sequence. See McClaln et 

30 al.. Am. J. of Human Genetics 41(Suppl. ):A259 (1987). 
pAC256 behaves In conformity with prior art expectation. 
Prior art teaches that even though VTR probes can detect 
many loci at low stringency, the probes can only Identify a 
single locus at high stringency. See, for example, 

35 Nakamura et al. (1987), Science 235: 1616, at 1618. 
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Figxire 11 illustrates Just this point. Genomic human 
DNA were extracted, digested with PstI, and hybridized in 
Che southern fori«at to pAC2E: human insert at 65 in 5X 
SSPE. Figure llA shows the results when the blot was washed 
5 at 65** C in 2X SSC, Figure IIB shows the results when the 
blot vtBS instead washed at 65*C in O.IX SSC. It is readily 
seen ttiat many of Hdie bands detected under the less 
stringent conditions were unstable under the more stringent 
conditions and were washed away* The fact that no more 
10 than two bands were seen in each of the lanes containing 
target human DNA is consistent with the notion that at most 
a single locus - the locus from which the probe sequence 
originated • can be detected. 

15 Example 9 — Equivalency Of Various DNA Sequences As 

Polymorphic Probes 

The following polynucleotide sequences were tested for 

differences, if any, in terms of the ability to detect 

polymorphism at multiple loci: DNA contained in Band C of 
20 Figure lA (*Band C DNA"); 19-MSP (defined below); the 

insert of pAC365; and the 5 subfragments described in 

Example 7. 

19-MSP was obtained from "Band C DNA" by digesting the 
latter with Mspl* "Band C DNA" appears to contain more 

25 than one species of DNA sequences of about 5 kbp. One 
species is cut into several smaller pieces by Mspl. This 
species does not detect polymorphism. The other species is 
also cut by Mspl, but the larger digestion product is only 
slightly smaller than 5 kbp. This approximately 5 kbp 

30 product is designated 19-MSP. 

19-MSP and "Band C DNA" detected the same polymorphic 
bands when tested against genomic DNA from more than twenty 
genetically unrelated individuals. "Band C DNA" was able 
to detect, in addition to these polymorphic bands, several 

35 non^polymorphic bands. These results are illustrated in 
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Figure 12. Figures 12A and 12B show Southern blots probed 
respectively with "Band C DNA" and 19-MSP, but are 
othezrwlse prepared In duplicate. Each blot contained PstI 
digested genomic DNA from genetically unrelated 
5 Individuals. When one of test lanes r for example, the 
rightmost test lane In Figure 12A Is compared with Its 
counterpart, the rightmost test lane of Figure 12B, It Is 
seen that the former contain extra bands which are marked 
with arrows. However, these are non-poIymorphlc bands, as 

10 evidenced by the appearance of the same sized bands In all 
other test lanes In Figure 12A. These results show that 
"Band C DNA" and 19-MSP recognize the same DNA 
polymorphisms « 

19-M5P and the Insert of pAC365 detected the same 

15 polymorphic bands when tested against genomic DNA from 107 
gent :lcally unrelated Individuals. These results show 
that the Insert of pAC365 and 19-MSP are equivalent probes. 

Similarly, each of the five subfragments described In 
Example 7 have been shown to be equivalent to pAC365. 

20 (Representative data shown In Figures 7 and 9). 

Example 10 - PAC365 Detects Multiple Loci Which Segregate 
Independently In the Mendellan Fashion 

Genomic DNAs were extracted from Individuals belonging 

25 to families spanning three generations, digested with PstI, 
and probed with pAC365 Insert In the Southern Format. 
Flgiire 13 shows the results of one such family study. The 
family tree at the top of the figure Indicate the source of 
DNA In each of the test lanes. The lanes marked (m) 

30 contained molecular weight markers. 

A total of 27 bands of varying sizes were detected by 
hybridization with pAC365. These are partially numbered on 
the side of the figure for Identification. Table 1 shows 
the phenotype of each Individual. 
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Several conoliisions can be drawn from these resxats. 
Fixst, each and every ban'1 which is present In any one of 
taie eigHt children Is also present in either the father or 
the mother. This result is consistent with statle 
5 chromosomal inheritance. Sirtlarly, every band which is 
present in the father io present in either th paternal 
grandfather or grand-mother. 

Second , the results are consistent with independent 
Mendelian segregation of alleles present on sister 

10 chromosomes. For example/ Bands 1 and 4 appear to be two 
alleles to the same gene. The mother has both of these 
alleles, presumably one on each sister chromosome. 
However, each of the eight children inherits one or the 
other allele, but never both. Thus, the mode of 

15 inheritance is consistent with independent Mendelian 
segregation. Similarly, Bands 10 and 17 in the father 
segregate in a manner which is consistent with the notion 
that they are two alleles on the same gene. 
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Table 1 , Inheritance And Segregation Of Allele Within A Famll y* 

Family 
Member 



12 3 45678 

Bands 

1 ^ + 

2 - + 

3 + + + + + + - + 

4 - + + + + - + + + -- -- 

5 +------------ 

6 + - + + + + + - 

7 - + - + - + + + - 

8 +------------ 

9 - + + + + + + + - 

10 ---- + + + + -- + - + 

11 + + - + + -- + -- + + - 

12 + + 

13 - + + + - + - 

14 + + + + + + + + + + + - + 

15 + - 

16 ------------^ 

17 -- + + --._- + + + + - 

18 + 

19 + + 

20 + _ 

21 + - 

22 -._- + + -- •-. + + - + 
23 

24 + + + + + + + + + + -- - 

25 -- + -- + + + + - + + - 

26 + + + + + + + + + + + + + 

27 + + + + + + + + + + + + + 



* No data from maternal grandmother. 
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Third, the resul-ts are consistent: with the notion 
that Band 1 -'s not linked to an^ of Band 7, 9, 13 or 19. 
These bandL are all present in the mother. However, child 
#8 inhe: ited Band 1, but not Band 7 from the mother. 
5 Therefore, Band 1 is not linked to Band 1. Child #1 
Inherited Bands 9, 13 and 19, but not Band 1 from the 
mother • Therefore, B?»nd 1 is not linked to Band 9, 13 or 
19. 

Similar analyses lead to the conclusion that Band 7 is 
10 not linked to any of the other Bands present exclusively in 
the mother, nor Band 9, nor Band 13 and 19. Bands i, 7, 9, 
13^ and 19 represent five genetic loci. 

Example 11 Population Genetics and Allele Frequency 

15 DNAs from 423 genetically unrelated individuals were 

tested in this study. Each DNA sample was digested with 
PstI and probed in a Southern h^ridization procedure. For 
127 samples, the insert from pAC365 was used as the probe. 
For 1±ie other 296 samples, 19-*MSP was used as the probe. 

20 However, it had been established that the two probes 
recognize the same DNA polymorphisms. Therefore, pooling 
of results obtained by using these probes is valid. The 
pooled results have been sorted according to their ethnic 
origins (i.e., American Blacks, Caucasoids, and 

25 Hispanics). The frequency v. allele size distributions 
are shown in Figure 14. Figures 14A, 14B, and 140 show the 
distributions in American Blacks, Caucasoids, and Hispanics 
respectively. The y-axis is measxired in per cent, and the 
X-axis is measured in kilobase pairs. The frequency 

30 distributions are more fully set forth in Table 2. 



wo 90/02818 



PCTAJS89/03473 



33 



Table 2 , Allele Frequencies For Three Racial Groups** 

Fragment Size (kilobases)*** 
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3.45 




BXaCKS 


1 


1 


9 


6 


7 


15 


8 


3 




Caucasolds 


1 


0 


1 


1 


1 


1 • 


2 


7 




Hlspanlcs 


2 


2 


9 


2 


2 


1 


2 


6 






3.50 


3.55 


3.60 


3.65 


3.70 


3.75 


3.80 


3.85 




Blacks 


5 






XD 






o 
o 


1 A 
X4 




Caucasolds 


2 


10 


17 


10 


27 


27 


11 


19 




Hlspanlcs 


3 


6 


15 


19 


17 


35 


17 


15 






3.90 


3.95 


4.00 


4.05 


4.10 


4.15 


4.20 


4.25 


4.3 


Blacks 


17 


5 


9 


5 


10 


14 


13 


7 


7 


Caucasolds 


23 


6 


5 


10 


13 


13 


44 


17 


12 


Hlspanlcs 


26 


5 


9 


7 


12 


15 


49 


8 


6 




4.35 


4.40 


4.45 


4.50 


4.55 


4.60 


4.65 


4.70 


4.75 


Blacks 


3 


15 


6 


5 


9 


16 


10 


7 


13 


Caucasolds 


8 


7 


8 


6 


10 


24 


8 


11 


13 


Hlspanlcs 


12 


6 


6 


8 


11 


20 


6 


14 


9 




4.80 


4.85 


4.90 


4.95 


5.00 


5.05 


5.10 


5.15 


5.20 


Blacks 


10 


13 


14 


9 


13 


15 


9 


8 


7 


Caucasolds 


8 


8 


21 


12 


12 


15 


20 


9 


12 


Hlspanlcs 


7 


6 


18 


8 


8 


15 


20 


9 


7 
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5.25 

Blacks 6 
Caucasoids 5 
Hlspanlcs 5 

5.70 

Blacks 10 
Caucasoids 7 
Hlspanlcs 11 



6.3 

Blacks 3 
Caucasoids 4 
Hlspanlcs 11 



7.2 

Blacks 13 
Caucasoids 5 
Hlspa&ilcs 11 

8.1 

Blacks 8 
Caucasoids 1 
Hlspanlcs 1 



9.0 

Blacks 3 
Caucasoids 2 
Hlspanlcs 1 



9.9 

Blacks 2 
Caucasoids 1 
Hlspanlcs 2 
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5.45 


7 


4 


5 


2 


12 


8 


8 


4 


9 


13 


2 


5 


5.75 


5.aU 


5 .05 


K on 


10 


6 


3 


3 


10 


8 


3 


12 


12 


10 


9 


4 


6.4 


6.5 


6.6 


6.7 


4 


8 


12 


5 


8 


9 


12 


9 


7 


6 


6 


6 




7.4 


7.5 


7.6 


4 


6 


5 


5 


2 


0 


2 


4 


6 


1 


7 


1 


8.2 


8.3 


8.4 


8.5 


3 


1 


5 


3 


2 


1 


2 


2 


5 


2 


1 


1 


9.1 


9.2 


9.3 


9.4 


1 


2 


3 


3 


1 


2 


4 


1 


2 


2 


1 


1 


10 


10.2 


10.4 


10.6 


3 


3 


0 


0 


2 


7 


7 


2 


2 


6 


2 


2 



5.50 


5.55 


5.60 


5.65 


6 


8 


5 


5 


7 


4 


5 


5 


3 


7 


6 


6 


R OR 


A n 
o. u 


fi 1 




3 


7 


4 


6 


2 


4 


11 


14 


8 


12 


10 


12 


o.o 


0.9 




*7 1 
/ .i 


7 


7 


1 


9 


4 


8 


8 


5 


8 


3 


2 


6 


7.7 


7.8 


7. y. 


o. U 


4 


2 


3 


4 


2 


0 


1 


1 


5 


4 


2 


0 


O.O 




o.o 


n Q 

O • 9 


4 


1 


2 


3 


1 ■ 


1 


2 


2 


0 


0 


4 


1 


9.5 


9.6 


9.7 


9.8 


3 


7 


3 


0 


2 


1 


1 


2 


0 


0 


0 


■ 1 


10.8 


11.0 


11.2 


11.4 


5 


4 


3 


0 


5 


7 


2 


1 


4 


6 


5 


3 
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11.6 


11.8 


12.0 


12.2 


12.4 


12.6 


12.8 


13.0 


13.: 


Blacks 


1 


0 


2 


1 


2 


2 


1 


1 


1 


Caucasoida 


2 


1 


2 


1 


1 

J. 


1 

X 






X 






n 


o 
u 


X 






U 


U 


n 
U 




13.4 


13.6 


13.8 


14.0 


14.2 


14.4 


14.6 


14.8 


15.0 


Blacks 


1 


0 


0 


1 


0 


0 


1 


0 


1 


Cauca^ olds 


1 




r\ 


i 

X 


u 


X 


0 


0 


0 


Hlspanlcs 


1 


1 


0 


1 


1 


0 


0 


0 


0 




15.2 


15.4 


15.6 


15.8 


16.0 


16.2 


16.4 


16.6 


16.8 


Blacks 


3 


0 


1 


1 


2 


0 


1 


0 


0 


Caucasoids 


1 


0 


0 


0 


0 


1 


0 


0 


1 


Hlspanlcs 


1 


1 


0 


0 


2 


0 


0 


0 


0 


17.0 


17.2 


17.4 


17.6 


17.8 


18.0 


18.2 


18.4 




Blacks 


5 


0 


0 


0 


1 


3 


0 


0 




Caucasoids 


0 


0 


0 


0 


0 


1 


0 


0 




Hlspanlcs 


2 


1 


0 


0 


1 


3 


0 


0 






18.6 


18.8 


19.0 


19.2 


. 19.4 


19.6 


19.8 


20.0 




Blacks 


0 


0 


2 


0 


1 


0 


1 


1 




Caucasoids 


0 


0 


2 


0 


1 


0 


0 


0 




Hlspanlcs 


0 


0 


2 


0 


1 


0 


0 


3 





** Allele frequencies are stated In per cent In Table 2. 

*** The standard error for fragment size Is approximately 0.6% 

of the size of the fragment. Therefore, DNA fragments 
whose sizes are within 2% of each other (3 standard 
deviations) are considered Indistinguishable. 
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Egample 12 Characterization of pAC365 

Genomic DNA from various human cell lines were 
extracted^ digested with Pstl^ and hybridized with pAC365 
Insert in the Southern format- 7024, 7351, 7047, 7432, 
5 7433 and 7015 were obtained from Centre d * Etude du 
PolYmorphlsme Humaln in France. 1202 was obtained from 
the National Institute of General Medical Sciences (NIOIS) 
Human Genetic Cell Repository (Catalog Number 1202B). It 
Is a lymphoblast cell line with 49 chromosomes (XXXXY). 

10 CEM and Jurket are T lymphoblastold cell lines » K562 is a 
erythroleukemla cell line and HL60 is a promyelocytic cell 
line. CEM, K562, arid HL60 can be obtained from the 
American Type Culture Collection ("ATCC") under ATCC 
catalog numbers CCL119, CCL243, and CCL240, respectively. 

15 The bands detected in the Southern blot are set forth 

in Table 3 below. For example, pAC365 insert detected 9 
bands ranging from 7.1 kllobase pairs to 2.2 kilobase pairs 
when hybridized with PstI digested K562 cell DNA. The 
banding pattern obtained for each cell line is unique. 

20 Therefore, when used for probe purposes polynucleotide 
sequences can be characterized, or "fingerprinted* by the 
banding pattern with known target DNA. 
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Table 


3 . " Fingerprint " 


of PAC365 












7024 


7351 


OEM 


1202 


HL60 


Jur. 


7047 


7432 


7433 


7015 


K562 


6.3# 


6.5 


20 


11 


10.7 


10.6 


9.6 


9.7 


9.3 


6.5 


7.1 


5.3 


5.6 


7.7 


.3 


6.6 


5.2 


9.2 


9.3 


5.7 


5.7 


6.2 


4.9 


5.3 


6.1 


5.8 


5.0 


5.1 


7.1 


6.4 


5.6 


5.4 


4.4 


4.4 


5.0 


5.0 


4.6 


4.6 


4.9 


5.7 


5.7 


5.1 


4.9 


4.2 


4.2 


4.9 


4.6 


4.2 


4.5 


4.7 


5.4 


5.4 


4.9 


4.4 


4.1 


3.7 


4.2 


4.1 


2.6 


4.3 


4.3 


5.2 


5.2 


4.2 


4.1 


3.7 


3.6 


4.1 


4.0 


2.3 


3.8 


4.2 


4.9 


5.1 


3.8 


3.6 


3.6 


2.6 


3.8 


3.8 


2.2 


3.5 


4.0 


4.7 


4.7 


3.7 


2.6 


2.6 


2.3 


2.6 


3.6 




2.6 


2.6 


4.2 


4.6 


2.6 


2.2 


2.2 


2.2 


2.2 


2.3 




2.2 


2.3 


3.9 


3.6 


2.2 ' 










2.2 






2.2 


3.7 


2.6 









2.7 2.2 
2.2 

# Sizes of the fragments detected are stated in kbp in Table 
3. 



Example 13 Paternity Testing and Forensio Testing 

Genomic DNAs were extracted from a child, the mother 
of the child and the alleged father. The DNAs were 
digested with PstI, electrophoresed and transferred for 
Southern hybridization. The DNA targets were probed with 
19-MSP. Figure 15A shows the results of the blot. Lanes 
labeled (ra) contained molecular weight markers. Lanes (a), 
(b), and (c) contained DNA from the mother, the child, and 
the alleged father of the child respectively. Lane (d) 
contained a mixture of the child's DNA and the alleged 
father's DNA. The last lane often helps to resolve 
ambiguity whenever a band detected in the child's lane is 
close in size to a band detected in the alleged father's 
lane. In such a case, the presence of a singlet band in 
the relevant size region in the "child plus alleged father" 
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lane would tend to Indicate a common allele; and a doublet 
band would Indicate distinct alleles* 

Figure ISA shows that at least 5 bands in lane (b) 
were not inherited from the mother because they are not 
5 pres. nt in lane (a>» However, each of these bands (marked 
with arrows) are present in lane (b). Therefore, the 
evidence supports the theory that the alleged father is 
indeed the biological father. 

DMAs were extracted from a rape victim, semen fotind on 

10 the victim and from a suspect of the crime. The DMAs were 
digested with PstI, and subjected to the Southern 
hybridization procedure* 19-MSP was used as a probe* 
Figure 15B shows the results of the hybridization blot. 
Lane (a) contained DNA from the victim. Lane (b) and (c) 

15 contained DNA from semen found on the victim, and from a 
suspect, respectively. At least 8 bands (marked with 
arrows) in lane (b) do not match the bands in lane (a), 
clearly indicating that these bemds did not arise from 
cells of the victim which some-how contaminated the semen 

20 sample. However, all 8 bands matched bands of the same 
sizes in lane (d). Therefore, the evidence that the semen 
came from the suspect was exceedingly strong. 

Example 14 Relationships Among The Polymorphic Probes Of 
25 The Invention 

Figure 16 shows the relationships among the various 
DNA sequences of the present invention, which are useful 
as polymorphic probes. 

30 DEPOSIT OF MICROORGANISM 

Many polynucleotide sequences may be used to practice 
the present invention. Exemplary of such sequences are 
human genomic sequences which have been cloned into 
recombinant plasmids designated pAC329, pAC344, and pAC365. 
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Flqure 16 shows the relationships among the cloned 
sequences of this invention 

An E. coll strain HBlOl carrying the plasmld pAC329, 
an E. coli stxain HBlOl carrying the plasmld pAC344, and an 
5 E* coli strain HBlOl carrying the pAC365 plasmld ^^ve been 
deposited with the Agricultural Research Culture 
Collection (NRRL), Peoria, XL, on September 2, 1988, and 
have been assigned accession numbers NRKL B*18403, NRRL B- 
18404, and KRRL B-18405, respectively, 

10 The subject cultures have been deposited under 

conditions that assure that access to the cultures will be 
available during the pendency of this patent application to 
one determined by the Commissioner of Patents and 
Trademarks to be entitled thereto under 37 CFR 1.14 and 35 

15 U.S.C. 122. The deposits are available as required by 
foreign patent laws in countries wherein counterparts of 
the subject application, or its progeny, are filed. 
However, it should be understood that the availability of a 
deposit does not constitute a license to practice the 

20 subject invention in derogation of patent rights granted by 
governmental action. 

Further, the subject culture deposits will be stored 
and made available to the public in accord with the 
provisions of the Budapest Treaty for the Deposit of 

25 Microoxganisms, i.e., they will be stored with all the care 
necessary to keep them viable and uncontaminated for a 
period of at least five years after the most recent request 
for the furnishing of a sample of the deposit, and in any 
case, for a period of at least thirty (30) years after the 

30 date of deposit or for the enforceable life of any patent 
which may issue disclosing the cultures. The depositor 
acknowledges the duty to replace the deposits should the 
depository be unable to furnish a sample when requested, 
due to the condition of the deposits. All restrictions on 

35 the availability to the public of the subject culture 
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deposits will be irrevonably removed upon the granting of a 

patent dislcosing them. 

E> -TOli HBlOl is available frcn the NRRL repository 

where its accession number is NRRL B-11371. Plasmids Cc^ 
5 be isolated from the col:' host by use of standard 

procedures, e.g., using cleared lysate-isopycnl, ; densily 

gradient procedures, and the Mke. 

The present invention is not to be limited in scope by 

the microorganisms deposited, since the deposited 
10 embodiment is intended as a single illustration of one 

aspect of the invention. Many variations of this 

invention as herein set forth may be made without 

departing from the spirit and scope thereof. The specific 

embodiments described are given by way of example only, and 
15 the invention is limited only by the terms of the appended 

claims. 
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SOBXPSST TTTATT OH rSt nCTaKAnONXi 

KCSSQizrrss or tsx c^toszt o? mzszoohcxnxsks 
roa tss 7C3FC-SS or past pbccxsuu 



lyaUMA gSOmJ. FORM 



Dr. biancy Ip 
Llfecodes O or p o ration 
Saw Kill River Boad 
Vall-ill, NT 10595 



1 



f ^ ffs r^'p XH TflE cxsz or ah oricikal oetosi 
isaiwd porsiunc co Ruia T. l by 

issBtnsxoKAL DDoszauor joRsoBZty 

ldea«lfl«d mv tli« bocsoa of ci&xs paga 



L 



SMS AHO A DDRp S 
or DC90SZ?0R 



J 



I, rcBsracpxcxsioii or the. mcaoowaatSH 


Idaii«.iie*«lon tntrnxmnem ^ymn ty ei« 
OCPOSZTORs 

Escherichia coli HBlOl (pAC3 29) 


Accttsaxoa aoBDar glvea hy th« 
nnSSHXrXOKAL OEPOSZTART AUTBOBZrSt 

NRRL B-18403 




r:. scrannc pr'i^ Jiu-fioM aho/or pboposed »d»omic oesxciatiom 



alcroorg»nla» ld«i*lll«d imdar I abov «u *cco«»*nled brt 
p~j a ■dimrtgle dMcrtp«lon 

PYI » propo..d «w»on4c d^yna^a 
(Bask with a eroaa vhars a gp U ca bla) 
a. Pf ' ' ' " "^ AHD A CSflrEABC 

. **«fM tiha ml tiiour g a nm id«Tirtfl*td wdor I abovai 



-aa., Agricultural Reseiarch Culture 

Collection QflSSL} 
ItttemacionAl Depositary Authority 
juidrawi 1815 H. Univeraity Street 

Peoria, Illinois 6160^ U.S.A. 



to repr*aan« cha Ia««*^<»^ »«P?**:r*^ 
Xuxixortty or of aut 



r&sad official (a) x 



Oaui 




SUBSTITUTE SHEET 
ISA/EP 
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»f-yL7±ST OJt tHT r 



UJecodss rorsorat^cti 
Saw Mill River Boad 
Valhall. NT 10595 



RSCSTTT DI TTO OLSr or AM OJUCIHXL O2?0«r 



L 



jyOtS AMD AOOJWS 
OF DSSQSTZOK * 



J 



z. 



Escherichia ccli HBlOl (pAC3 44) 



tronncxrsoM or « »ccaoow»«SK 



Xeeusloa aaab«r gtwn by eh* 
IStSSHXZXOtlAL OSPOSZrUtf AOTBOXXrr: 

NRRL B-ia40 4 



-Y] A proposed caacofuwic d««l9n*taoa 
(Uark with * oso*» .ppUcablel 



^ rT3Zr*««fl« th. aleroor9«UL« ld«nc4ll«d oador I mbove. 



jHxriaatfioHAt oggosnxac AOgoHrrr 



Agricultural Research Culttire 

CoUecCion OTRRL) 
lacamaclonal Depoaicary Auchoricy 
Addro... 1815 t^. ttaiveraity Screec 

Peoria^ IlUnoia 61604 tr.S.A* 



Sltm«ttxr«(sl of yanoaCs) haTln? th« power 
StSrtJJ o£ 4Et«ort«d ofltci^(.l i 



laemm&eloaal depodeacy «ueboclty» 



toza BP/4 (sol* P«9«^ 



SUBSTITUTE SHEET 
ISA/EP 
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Hscocaiion or the o»<»" or miowor^misks 
roB va rmosts or f asht fkoocoom 



mrsBiixTSONU. rowt 



r ' 1 
Dr. Nancy Ip ^ ' 

Lifecodes Corporation 
Saw Mill River Road 
Valhallr 10595 



RTCSIPT IM THt CXSr OT AM ORICIMAL DEPOSIT 
IsatMd puzsuaac to Rola 7.1 by 
XtnSSlUXXQMAL OEPOSnXRT AOTHOKITT 
ldaav«,£lad «»• bo«o of tftia p«9« 



L 



NAIE AMD AOOnESS 
or DEPOSITOR 



J 



I. iDEMtiriexsioH or the kicworca»ism 


Idantlfieacion r«f aranco by th« 

DSPOSETOAs 

Escherichia coli HBlOl (pAC365) 


Aceaaalon aoabar ffX,v«n by tha 
XHTXRHATIOinkL OEPOSZTARX AOTKORZT^Ct 

HRRL B-18A05 






n. sdzsTinc oESCnirttoa abo/or proposed txxwohic oesichatiom 


t&« mtcroor^anlan ldea«l£i«d widar I *boira i«a »ccamp«Jil«d byt 
a sclentiLfle doacrtptlan 


["Y] * proposed trnxanomXc A^sXtfUA^Xon 
(hBxk wi«h « eroaa wtMra applleablo) 






wtileh w«a xaeaXTad by 1« onSep-.2,19b5 (data o 


tb« Bicrooryaniaa ideaclilad ondar X abora* 
t tha orlfloal dapoaitt^ 
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Collection QXBSL) 
latemaclonal Depot It ary Author! rr 
M^^Tt' 18^5 Ottlv«rslcr Street 

Peoria, TUlaois 61604 U.S.A. 


Signature (a) oi partonUl ba7l»9 P***'*^ 
to rapraaant tha Inearnatlonal Oapoaltary 
Aotftortty or o£ «3&ion»«d oiSidaKtM 
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rers BP/4 (sela pa^a) 



SUBSTITUTE SHEET 
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Claims 

1. A polynucleotide sequence capable of forming 
hybrids with genomic Variable Tandem Repeat DMA sequences 



3 at multiple, polymorphic genetic loci of a eukaryote under 

4 hybridization conditions wherein said hybrids ate stable 

5 xm^er high s cbringency conditiom,. 

1 2. The polynucleotide sequence of claim 1 wherein the 

2 eukaryote is a mammal « 

1 3. The polynucleotide sequence of claim 2 wherein the 

2 mammal is a human* 

1 4. A polynucleotide sequence represented by the human 

2 DNA <nsert in pAC329. 

1 5. A polynucleotide sequence represented by the human 

2 DNA insert in pAC344. 

1 6. A polynucleotide sequence represented by the 

2 restriction fragment of about 1.35 kilobase pairs produced 

3 by digestion of pAC365 with EcoRI and BamHI. 

1 7. A polynucleotide sequence represented by the 2 
restriction fragment of about 0.55 kilobase pairs produced 3 b y 
digestion of pAC365 with EcoRI and Ncol. 

1 8. A discrete polynucleotide subsequence of the 

2 polynucleotide sequence of any of the claims 4 to 7, 

3 wherein said subsequence is capable of forming hybrids with 

4 genomic DNA at one or more polymorphic genetic loci of a 

5 eukaryote, and mutational, insertional or deletional 

6 variants thereof- 
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1 9. The discrete p'^lynucleotide subsequence of c^aim 8 

2 wherein the eukaryote is a human. 

1 10. A polynucleotide sequence capable of forming a 

2 hybrid with the polynucleotide sequence of any of the 

3 claims 4 to 7, wherein the hybrid is stable under high 

4 stringency conditions. 

1 11. A polynucleotide sequence capable of forming a 

2 hybrid with the discrete polynucleotide subsequence or 

3 with its variants of claim 8, wherein the hybrid is stable 

4 under high stringency conditions. 

1 12. A polynucleotide sequence capable of forming 

2 hybrids with genomic DNA fragments, produced by complete 

3 digestion of Caucasoids*, American Blacks' and Hispanics' 

4 genomic DMAs with the restriction endonuclease Pstl, of 

5 approximate lengths and approximate allele frequencies as 

6 described in Table 2. 

1 13. A polynucleotide sequence useful for detecting 

2 pol3^orphism at multiple genetic loci^ and characterized by 

3 its ability to form hybrids with restriction fragments of 

4 DNA, of 7.1, 6.2, 4.4, 4.2, 4.1, 3.7, 3.6, 2.6 and 2.2 

5 kilobases, produced by PstI digestion of genomic DNA 

7 extracted from K562 cells. 

1 14. A recombinant vector comprising the polynucleotide 

2 sequence of any of the claims 4 to 7. 

1 15. A recombinant vector comprising the polynucleotide 

2 subsequence of claim 8. 



1 

2 



16. A recombinant vector comprising the polynucleotide 
sequence of claim 10. 
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1 17 • A recombinant vector comprising the polynucleotide 

2 sequence of claim 11. 

1 18. A recombinant vector comprising the polynucleotide 

2 sequence of claim 12 or 13. 

1 19, A cell containing the recombinant vector of claim 

2 14. 

1 20. A cell containing the recombinant vector of claim 

2 15^ 

1 21. A cell containing the recombinant vector of claim 

2 16. 

1 22. A cell containing the recombinant vector of claim 

2 17. 

1 23. A cell containing the recombinant vector of claim 

2 18. 

1 24. A method for obtaining the polynucleotide sequence 

2 of claim 1 comprising: 

3 (a) hybridizing a library of genomic sequences of an 

4 organism under hybridization conditions with a 

5 probe comprising a natural or consensus sequence 

6 of a Variable Tandem Repeat; 

7 (b) selecting library sequences of said library which 

8 hybridize with said probe; 

9 (c) testing and further selecting from the library 

10 sequences selected in step (b) a library sequence 

11 which is capable of detecting Variable 

12 Tandem Repeat Length polymorphism at multiple 
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13 genetic loci of said organism or a different 

14 org misro ur ier high stxingency conditions. 

1 25. A method for obtaining the polynucleotide sequence 

2 of claim 1 comprising: 

3 (a) hybridizing a library of genomic sequences of an 

4 organism unuer hybridization conditions with a 

5 probe comprising a natural or consensus sequence 

6 of a Variable Tandem Repeat; 

7 (b) selecting library sequences of said library which 

8 hybridize with said probe; 

9 (c) producing subsequences of said selected library 

10 sequences by enzymatic means; 

11 (d) testing and selecting from the subsequences 

12 produced in step (c) a subsequence which is 

13 capable of detecting Variable Tandem Repeat 

14 Length polymorphism at multiple genetic loci of 

15 said organism or a different organism under high 

16 stringency conditions « 

1 26. A polynucleotide probe comprising the 

2 polynucleotide sequence of any of the claims 4 to 7, or 

3 mutational, insertional, or deletional variants thereof. 

1 27 . A polynucleotide probe comprising the discrete 

2 polynucleotide subsequence of claim 8, or mutational, 

3 insertional, or deletional variants thereof. 

1 ' 28. A polynucleotide probe comprising the discrete 

2 polynucleotide subsequence of claim 10, or mutational, 

3 insertional, or deletional variants thereof. 

1 29. A polynucleotide probe comprising the 

2 polynucleotide sequence of claim 11, or mutational, 

3 insertional, or deletional variants thereof. 
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1 30. A polynucleqtide probe comprising the 

2 polynucieotide sequence of cla^.n 12 or 13, or mutational, 

3 insertional, or deletional variants thereof. 

1 31. A method of genetic analysis comprising: 

2 fa) digesting a DNA sample with a restriction 

3 endonuclease; 

4 (b) separating the DNA restriction fragments 

5 according- to size by electrophoresis; 

6 ( c ) trans f err ing the separated DNA to a binding 

7 surface; 

8 (d) hybridizing the transferred DNA with a 

9 polynucleotide probe labeled with a signal- 

10 generating moiety, wherein the polynucleotide 

11 probe is the polynucleotide probe of claim 26; 

12 and 

13 { e ) detecting the signals generated ; whereby the 

14 pattern of signals generated provides information 

15 about the composition of the DNA sample. 

1 32. A method of genetic analysis comprising ♦ 

2 (a) digesting a DNA sample with a restriction 

3 endonuclease ; 

4 (b) separating the DNA restriction fragments 

5 according to size by electrophoresis; 

6 (c) transferring the separated DNA to a binding 

7 surface; 

8 (d) hybridizing the transferred DNA with a 

9 polynucleotide probe labeled with a signal- 

10 generating moiety, wherein the polynucleotide 

11 probe is the polynucieotide probe of claim 27; 

12 and 

13 (e) detecting the signals generated; whereby the 
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14 pattern of signals generated provides information 

15 about the composl *:ion of the DNA sample. 

1 33. A method of genetic analysis comprising: 

2 (a) digesting a DNA sample with a re: triction 

3 endonuclease; 

4 (b) separating the DNA restriction fragments 

5 according to size by electrophoresis; 

6 (c) transferring the separated DNA to a binding 

7 surface; 

8 (d) hybridizing the transferred DNA with a 

9 polynucleotide probe labeled with a signal- 

10 generating moiety, wherein the polynucleotide 

11 probe is the polynucleotide probe of claim 28; 

12 and 

13 (e) detecting the signals generated; whereby the 

14 pattern of signals generated provides information 

15 about the composition of the DNA sample. 

1 34. A method of genetic analysis comprising: 

2 (a) digesting a DNA sample with a restriction 

3 endonuclease ; 

4 (b) separating the DNA restriction fragments 

5 according to size by electrophoresis; 

6 (c) transferring the separated DNA to a binding 

7 surface; 

8 (d) hybridizing the transferred DNA with a 

9 polynucleotide probe labeled with a signal- 
ly generating moiety, wherein the polynucleotide 

11 probe is the polynucleotide probe of claim 29; 

12 and 

13 (e) detecting the signals generated; whereby the 

14 pattern of signals generated provides information 

15 about the composition of the DNA sample. 
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